Things between Lexicon and Grammar

نویسنده

  • Yuji Matsumoto
چکیده

A number of grammar formalisms were proposed in 80’s, such as Lexical Functional Grammars, Generalized Phrase Structure Grammars, and Tree Adjoining Grammars. Those formalisms then started to put a stress on lexicon, and were called as lexicalist (or lexicalized) grammars. Representative examples of lexicalist grammars were Head-driven Phrase Structure Grammars (HPSG) and Lexicalized Tree Adjoining Grammars (LTAG). While grammars and lexicons were two major linguistic resources of syntactic processing of natural languages, lexicons began to play an important role in language processing. Things have changed from early 90’s, when large scale language resources became available and corpus-based research started to dominate almost all aspects of natural language processing (NLP). Partof-speech taggers and syntactic parsers are the most well-studied topics in corpus-based research. Various parsers, based either on phrase structure grammars or on dependency structures, have been developed, applying various machine learning techniques on syntactically annotated corpora. State-ofthe-art parsers developed in this way have achieved very good performance. Those trends are also beneficial to lexicalist grammars since parsing with those grammar formalisms is amenable to phrase structure-based parsing through abstraction of grammatical schemata or a derivation process with those grammar formalism (i.e., a derivation tree) can be considered to correspond to a word dependency tree. Recent trends in NLP have started to target diversely spread areas that require semantic and pragmatic information. Some areas like social media analysis, such as twitter or blog text analysis, have a more preference to getting semantic or sentiment information than syntactic information. Though this trend is attracting people’s attention and is getting growing importance, still syntactic analysis keeps to play an important role. Simple extension of annotated corpora and lexical statistics will not be able to skyrocket parsers’ performance. Improvement of parsing accuracy especially that of long sentences requires to tackle problems that are not on the current main stream of parser development. In this talk, I will take up three issues that lie between grammars and lexicons: Coordination structures, multiword expressions and complex sentence patterns. I will first give a brief overview of syntactic processing in past two/three decades, then will talk about the issues one by one especially about our experiences related with them. Finally, I will consider future directions of sentence analysis taking those into account.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Montagovian Generative Lexicon

Generative Lexical Semantics aims, among other things, to model cocompositional meanings that classical Montague semantics is unable to express. This paper outlines a simple extension of Montague semantics which amends the standard compositional mechanisms for the use of specific terms provided by the lexicon, in order to account for this theory. We hope that such a logical system would prove b...

متن کامل

Causatives and the empty lexicon: A minimalist perspective

What is the constitution of the meaning of morphemes (lexical concepts)? According to most theories, such meanings have a molecular or holistic internal structure: prototypes, exemplars, semantic networks, complex schemata, scripts, and even classical definitions. Recently, however, contrary opinions have arisen in cognitive science suggesting that lexical concepts are not semantically structur...

متن کامل

Constraints on the computational component vs. grammar in the lexicon: a discussion of Bates & Goodman.

In The emergence of language, it is argued that much of what generative linguistics has characterized as ‘rules’ in fact can be derived from domaingeneral cognitive mechanisms. One example of this perspective is Bates & Goodman’s contribution ‘On the emergence of grammar in the lexicon’, which Sabbagh & Gelman say offers ‘...a series of compelling arguments detailing how development and early a...

متن کامل

Lexicon-Grammar And The Syntactic Analysis Of French

A lexicon-grammar is constituted ot the elementary sentences of a language. Instead of considering words as basic syntactic units to which grammatical information is attached, we use simple sentences (subject-verb-objects) as dictionary entries, Hence, s full dictionary item is a simple sentence with a description of the corresponding d is t r ibut ional and t ransformat ional propert ies, The ...

متن کامل

Generalizing Subcategorization Frames Acquired from Corpora Using Lexicalized Grammars

This paper presents a method of improving the quality of subcategorization frames (SCFs) acquired from corpora in order to augment a lexicon of a lexicalized grammar. We first estimate a confidence value that a word can have each SCF, and create an SCF confidence-value vector for each word. Since the SCF confidence vectors obtained from the lexicon of the target grammar involve co-occurrence te...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012